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Abstract 

This paper presents a new minimum-weight 
trellis-based soft-decision iterative decoding al- 
gorithm for binary linear block codes. Simu- 
lation results for the RM(64,22), EBCH(64,24), 
RM(64,42) and EBCH(64,45) codes show that the 
proposed decoding algorithm achieves practically 
(or near) optimal error performance with signif- 
icant reduction in decoding computational com- 
plexity. The average number of search iterations 
is also small even for low signal-to-noise ratio. 

1 Introduction 

Recently Moorthy et. al.[2] have proposed a 
zero-and-minimum-weight subtrellis-based itera- 
tive decoding scheme for binary linear block codes 
to achieve a very good trade-off between error 
performance and decoding complexity. In the 
scheme, all the candidate codewords are gener- 
ated by an algebraic decoder based on a set of 
test error patterns proposed by Chase[l]. The 
zero-and-minimum-weight trellis search around 
the current best candidate codeword c is per- 
formed at most once, only if (i) a sufficient condi- 
tion that the optimal solution is within the min- 
imum distance from c holds or (ii) all the test 
error patterns have been exhausted and no can- 
didate codeword satisfies the sufficient condition 
for optimality. 

For the proposed decoding algorithm in this 
paper, preliminarily presented in [6], the initial 
candidate codeword is generated by a simple de- 
coder, the zero-th order or the first order decod- 
ing proposed in [4]. The subsequent candidate 
codewords (if needed) are generated by a chain of 
minimum- weight trellis searches. This minimum- 
weight trellis around a candidate codeword c con- 
sists of only the codewords in code C that are at 
the minimum distance from c, but does not in- 
clude c. The decoding iteration stops whenever a 
candidate codeword is found to satisfy a sufficient 
condition for optimality or the latest minimum- 
weight trellis search results in a repetition of a 
previously generated candidate codeword. Let 
this decoding algorithm be denoted Algorithm I- 


w\. The decoding process terminates faster than 
the Moorthy et. al. algorithm. Furthermore, the 
use of minimum-weight trellis search considerably 
reduces the possibility of being trapped into a lo- 
cal optimum. As a result, it achieves better 
error performance than the Moothy et. al. algo- 
rithm. 

A necessary condition for Algorithm I-uq to 
achieve good error performance is that the min- 
imum weight codewords span the entire code. 
Reed-Muller(RM) codes satisfy this condition. 
Simulation results for the RM(64,22)(the (64,22) 
RM code), RM(64,42) and the EBCH(64,45)(the 
extended (64,45) BCH code) codes show that the 
proposed decoding algorithm practically achieves 
optimum MLD performance even in the range of 
relatively low SNR. The EBCH(64,24) code is an 
example for which the above necessary condition 
does not hold. For this code, the first and second 
minimum weight codewords span the entire code. 
In this case iterative decoding algorithm based 
on the first and second minimum-weight trellis 
search, denoted Algorithm \-w 1 -w 2 , is used. 

We also propose another approach to overcome 
the problem. Let Co be a linear subcode of C and 
assume that the minimum weight codewords of Co 
span Co. The decoding scheme is a combination 
of: (1) the iterative search using the minimum- 
weight trellis for Co around the latest candidate 
codeword, and (2) a procedure for moving from 
the coset of Co in C, containing the current candi- 
date codeword, to another coset which is likely to 
contain the optimal solution. Simulation results 
for the EBCH(64,24) code show that this scheme 
achieves better error performance than Algorithm 

\-Wi~W2' 

2 Sufficient Conditions for Op- 
timality 

Suppose a binary (JV, K) linear block code C is 
used for error control over the AWGN channel 
using BPSK signaling. Let z = ( 21 , 22 , . . . , zn) be 
the binary hard-decision sequence obtained from 
the received sequence r = (rj., 7*2, . . . , r^v)- 

Let Vn denote the vector space of all binary 
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iV-tuples. For an A r -tuple u = ( 14 , 1 * 2 - . . u n) G 
Vjy, let L(u) be defined a s follows: 

L(u)= hi- (1) 

{i:u t ^z t and 

L(u ) is called the correlation discrepancy of u 
with respect to z , and the smaller L(u) is, the 
larger the correlation between u and r is. For 
u and v G V /y, u is said to be better than v 
if L(u) < L(u). For a nonempty subset X of Vjv 
and a positive integer /i, let h! denote min{|A|, h} 
and let best^X denote the set of the h f best 72- 
tuples in X , that is, for any u G best/* A and 
v £ X — best^A, L(u) < L(v). The best n-tuple 
in X will be denoted best A. 

Let d/f(u, v) denote the Hamming distance be- 
tween two AAtuples, u and v . For u\, U 2 , 
. .., Uh E V/v and positive integers dx, d 2 , . . M 
let V N {ui,d\\U 2 ,d 2 \^ .\Uh.dh) be defined 
as the set: {u G Vpj : dx{u y Ui) > d x for 

1 < i < /i}, and let L(ux, d\\ U 2 , d 2 \ • . . ; u/*, dh) 
be defined as the minimum of L(u) over u G 
V N (u 1 ,di]U 2 ,d 2 -,...;u h ,d h ). 

Then we have the following early termination 
condition of an iterative decoding algorithm with- 
out degrading the error performance. 

Lemma: At a stage of an iterative decoding al- 

gorithm for a block code B(C itself or a coset of 
a linear subcode of C), let GC denote the set of 
those candidate codewords which have been gen- 
erated already, and let Ubest denote the best of 
GC. Suppose that for U\, U 2 , . . ., G GC and 
positive integers dx, c? 2 , . . d^, Ub es t Is the best 

of u£_ 0 {tt G B : d//(tx, Ui) < d x }. If Ub es t satisfies 
the following condition, 

L{u b^t) < L{u 1 ,d l -,u 2 ,d 2 -...;u h ,d h ), (2) 

then tibest is the optimal MLD solution in B. AA 
Expressions for evaluating the right-hand side 
of (2) for h = 1,2 and 3 have been derived [3]. 
These bounds are used in our proposed iterative 
decoding algorithm for early termination condi- 
tions. 

3 Decoding Procedure 

Let C be a binary linear (AT, K) code with weight 
profile W = {0, wi,W 2 ’> . . and let Cq be a bi- 
nary linear (AT, Kq) subcode of C with weight pro- 
file W 0 = {0,1^01)^02) • * •}) where w\ is the mini- 
mum weight of C, woi is that of Co and w t < wq{. 
As a special case, Co may be C itself. 

3.1 Minimum- weight Subtrellis Search 

for a Coset 

For B G C/Co and u G B y minimum- weight 
search (u, B) denotes a search procedure for find- 


ing a codeword in B , denoted <pa(u), which has 
the least correlation discrepancy with respect to z 
among all codewords in B at the minimum Ham- 
ming distance wq\ from u. That is, 

= best{v G B : dn{v,u) = u?oi}< (3) 

If Cq is spanned by the set of minimum weight 
codewords of C, that is, Co = C or C is an 
UEP(unequal error protection) code, then woi = 
w\ and 

*PB{y) — <pc(u) — best{u G C : dn(v y u) — u>x}. 

w 

This search procedure is implemented by using 
the minimum- weight subtrellis of Cq around u. 
This minimum-weight subtrellis is sparsely con- 
nected and much simpler than the full trellis of 
the code[2, 6]. 

Iterative minimum- weight search {u,B) is to 
generate a sequence of candidate codewords, 
<pa(u), **•> until a certain termina- 

tion condition holds. It is shown in [6] that 

LW'b 2 ( u )) < L{<p l B {u)), for i > 0, (5) 

where <p%(u) = u and cp l B l (u ) = (p((p l B (u)) for 
i > 0, and that if L(<p l B 2 (u)) < L((p B (u)) for 
0 < i < 7, then ^(u) with 1 < i < I are all 
different. If j is the smallest index such that 

%i- 2 («)) = fKW), “ (6) 

then 

min {L(ip l B (u)) : 0 < i < j} — 

min {L(^ _2 («)),L(^ _1 (u))}. (7) 

The condition (6), denoted Cond#, is used as one 
of termination conditions for Iterative minimum- 
weight search (u,B) to avoid repetition. 

3.2 Decoding Procedure for C 

Suppose the set of codewords of weight u>ox i n Co 
spans Co and K — Kq is not large. We propose 
a new decoding procedure for C which consists 
of iterated minimum-weight searches in a coset 
and coset shiftings. Two early termination condi- 
tions, Condc for the entire procedure and Conds 
in the subprocedure for a coset B G C/Co, 
used besides the termination condition Cond^ in 
the subprocedure for a coset. 

Condo is sufficient condition [3] that the best 
candidate codeword, denoted Ub es t) se * of 

those candidate codewords which have been gen- 
erated already, denoted GC, is optimum based on 
best^CC and the weight profile W of C, where h 
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is a specified small integer. From the Lemma, 
Condc is defined as 

L{u best) < L(u 1 ,d 1 ]u 2 ,d 2 -,. ..;ui,di), (8) 

where l = min{/i, |GC|} and for 1 < i < l, U{ E 
best hGC and if <pc( u i) £ GC, then d t = w 2 and 
otherwise, di = W \ . 

Cond# is a sufficient condition that there re- 
main no codewords in B better than tibest* 
This condition is also based on best^GCe where 
GCq = GC fl B and the weight profile Wo of Co 
which is the same as the distance profile of any 
coset of C/Co. From the Lemma, Cond# is de- 
fined as 

T(^best) — •> ^li ^2? ^2) • * • J ^/bj di B ), (9) 

where Ib = min{/i, |GCb|} and for 1 < i < Ib, 
U{ G best^GCa and if <PB(ui) £ GCg, then dj = 
W 02 and otherwise, d{ = wqi . For instance, if the 
minimum or the second minimum weight of Cq 
is greater than that of C, then Cond# is more 
effective than Condc only. 

In the procedure, global variables GC h and 
GCb } h are used besides GC and ttb est . GCh de- 
notes the current value of best^GG and GCjg^ 
denotes the current value of best/jCC^. For 
B G C/Co, lot /(r,j9) denote the initial candi- 
date codeword in B for a given received sequence. 

3.2.1 Decoding Algorithm II 
We assume that z £ C. 

(Dl) (i) Generate f(r y B) for all B G C/Co, and 
number the 2^“^° cosets of C/Co in the increas- 
ing order of correlation discrepancy, L(f(r,B)). 
(ii) Initialize GC «— {f(r,B) : B G C/Co}, 
libest bestGC and GCh best^GC. If Condc 
holds, then output “Ubest and stop. Otherwise, ini- 
tialize GC b , GC B) h <- for 5 G C/C 0 , 

and perform Search-in(the first coset). 

(D2) Search-in(B): Execute Iterative minimum- 
weight search(/(r, B) y B ) together with updat- 
ing the global variables each time a new candi- 
date codeword is generated until either Cond^, 
Condc or Cond# holds. If Condc holds, then 
output Ubest and stop. Suppose that Cond# or 
CondB holds. If all cosets have been exhausted, 
then output ttbest and stop; and otherwise, call 
Search-in(the coset next to B ). 

(D3) If either Condc ° r Cond^ holds for every 
coset in C/Co, the output is optimum. 

For the special case where C — Co, this decod- 
ing algorithm becomes Algorithm I-uq. 

3.3 Choice of the Initial Canidate 
Codeword f(r y B) for 5 G C/C 0 

For a given received sequence r = (ri,r 2 , . • . , rjy), 
let Mk be the location set of the most reliable 


basis of the column space of a generator matrix of 
C, and let A n~K 0 be the location set of the least 
reliable basis for the column space of a parity- 
check matrix of Co. Then it follows from Theorem 
1 in [5] that 

I Mk n A/v_*: 0 | = K - Kq. (10) 

For x = (xiy X 2 , • . • , xjv) € Vpj and a coset B G 
C/Co, a codeword u = (ui, U 2 , . . . , ujv) £ B sat- 
isfying the following condition is uniquely deter- 
mined: 

Ui = for all i G Mk — A jv_/c 0 . (11) 

Let g(x y B) denote the above codeword u in B. 

Then g(z,B) can be chosen as the initial candi- 
date codeword f{r,B) for B = C/Co, where z is 
the hard-decision binary vector obtained from the 
received sequence r. This g(z y B) for B = C/Co 
is a simple generalization of the zero-th order de- 
coding proposed by Fossorier and Lin[4] to a coset 
of Co in C. Similarly, a generalization of their first 
order decoding can be used. 


4 Examples 

Example 1: Let C = EBCH(64,24) and Co = 
RM(64,22). Then uq = 16, u >2 = 18, woi = lb 
and wq 2 = 24. Decoding Algorithm II, where 
early termination condition Cond# is not used, 
has been simulated for this code. The simulation 
results are shown in Figures 1 and 2. For compar- 
ison, the simulation results of Algorithm I-W 1 -W 2 
for this code are also shown. Figure 1 shows the 
bit error probabilities. We see that Algorithm 
II practically achieves optimal error performance. 
Figure 2 shows the average numbers of opera- 
tions(addition and comparison) of Algorithm I- 
W 1 -W 2 and Algorithm II. The numbers depend on 
the complexities of subtrellises used in the sim- 
ulation. The minimum weight subtrellis of the 
RM(64,22) code used in Algorithm II and the first 
and second weight subtrellis of the EBCH(64,24) 
code used in Algorithm I-uq-u/2 are obtained sim- 
ply by purging the 4-section full trellis diagrams 
of the codes. The construction of better subtrellis 
is under study. 

We see that the average number of operations 
of Algorithm II can be reduced by using Cond#. 
Example 2: Let C = Co = EBCH(64,45) with 
wi = 8 and W 2 = 10. For this case, the simula- 
tion results of Algorithm I -uq, where the initial 
candidate codeword is provided by the first order 
decoding in [5], are shown in Figures 3 and 4. 

Simulation results of the RM(64,22) and 
RM(64,42) codes are shown in [6], 
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Figure 1: Bit error probabilities for the 

EBCH(64,24). 



Figure 2: Average numbers of operations for 

EBCH(64,24). 
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Abstract — Maximum likelihood decoding of 
long block codes is not feasable due to large 
complexity. Some classes of codes are shown to 
be decomposable into multilevel concatenated 
codes(MLCC) [3j. For these codes, multistage 
decoding provides good trade-off between per- 
formance and complexity. In this paper, we de- 
rive an upper bound on the probability of block 
error for MLCC. We use this bound to evaluate 
difference in performance for different decom- 
positions of some codes. Examples given show 
that a significant reduction in complexity can 
be achieved when increasing number of stages of 
decoding. Resulting performance degradation 
varies for different decompositions. A guide- 
line is given for finding good m-level decompo- 
sitions. 

I. Introduction 

Multistage Decoding (MSD) is an efficient soft deci- 
sion decoding method for long decomposable codes, 
such as multilevel concatenated codes (MLCC) [1]. Al- 
though suboptimum in performance, it greatly reduces 
the computational complexity as compared to opti- 
mum decoding. 

Since its inception in Hemmati’s paper [2], Closest 
Coset Decoding (CCD) of \u\u + v| codes and general- 
izations of CCD have been investigated by several au- 
thors. However, only a few papers on the performance 
analysis of this method have appeared [3-6]. Further- 
more, these methods are restricted to |u|u -f* v\ codes 
and some others to block modulation codes. The dif- 
ferent code-structure of MLCC’s precludes application 
of the analysis therein to the case of MLCCs. 

In [3], an upper bound on the effective error coeffi- 
cient (EEC) for 2-stage decoding of MLCC’s was de- 
rived and some guidelines for choosing a good 2-level 
decomposition of Reed Muller codes were given. 

In this paper, we first derive an upper bound on block 
error probability of MSD of MLCC’s, when optimum 
decoding of each stage is performed. We first express 
the upper bound in terms of all the error coefficients, 
and then explain how these coefficients can be ob- 
tained using some combinatorial methods and weight 

^his work was supported by NSF grant NCR-9415374 and 
NASA grant NAG 5-931. 


distributions of inner and outer codes of a MLCC. The 
bound enables prediction of the performance of MSD 
(two or more levels) without simulation. Therefore an 
estimate of performance degradation, when the num- 
ber of decoding stages is increased, can be obtained. 

II. Analysis of CCD of Multilevel 
Concatenated Codes 

For an M-level code C = {Bj, B2, . . . , B a \/ } * 
{Ai, A 2 , . . . , A*/}, consider the i-th stage of decoding 
for 1 < i < M. At each stage, decoding is assumed to 
be complete (an estimate of a transmitted codeword is 
always given) soft decision maximum likelihood closest 
coset decoding [5]. 

Let denote the probability that the i-th stage de- 
coder makes an error when all previous (i— 1) stages are 
correct. Then, conditioning on the i-th stage decoding 
being in error, we obtain P t = ^ general, 

depends on the codeword at the output of the i-th 
inner encoder. However, for binary linear codes, BPSK 
transmission and the AWGN channel, P& ^ is the same 
for all transmitted codewords [7]. For simplicity, we 
assume that the all-zero codeword is transmitted. In 
this case, the output of each Bj-encoder at the trans- 
mitter is the all-?ero codeword in B t . The received 
(n^n B )- tuple r = (r^\r (2) , . . . ,r (nsJ ) is sectionalized 
into tiq sections each of length n At where n# and n A 
are lengths of outer and inner code respectively. 

E11 error at the i-th stage, when CCD is used, occurs 
if a nonzero codeword b = (bi , 60, . . . , &n B ) £ B * has 
larger correlation than the all-zero codeword z 6 B,. 
Let a J = (a{,a ; 2 ,...,a J n J be the codeword with the 
best correlation in the coset corresponding to j-th sym- 
bol 6y, for j = 1,2, . . ., nfl. If a standard BPSK map- 
ping (0 — 1,1 1) is applied, then the difference 

in metrics of b and z can be bounded by the following 
expression: 

M(b) - M(z) < E (-2rf), (1) 

where M(-) denotes correlation metric. Thus the prob- 
ability of error at the i-th stage is given by 

P (i) = P{M(h) - M( z) > 0. b € Bj, b £ 0). (2) 



For a particular a and b, 


n b 

T(a,b)= £ £ (—2rj) 

j = l,bj?0 l=l a J = l 

is a Gaussian randome variable with mean /z and 
variancecr 2 given by 

M = ^2 w (*)> = 4 u;(a) 

b 3 pLO b } ?tO 

where A is the amplitude of a transmitted BPSK signal 
(SNR=10 log (A 2 /2 k/n)). It then follows that 


p(i) < EE E P(T(a,b)>0) 

beBj *j^°a6ft(b) 


= EE E 

beBj ^€0(6^) 



(3) 


where D(&j ) represents the coset corresponding to the 
symbol 6j, and tu(-) the weight of a codeword. 

Note that for any 0 ^ b £ B, 0 ^ a E A,-, 
Ylbj ? £0 u; ( a ) ^ ^min(G) 

Definition 1: Define Sj(y) as the number of terms 
of the form A erfc (Ay/Y/2j satisfying 
i - y = ^( a ) 

2. beB it aen(6j) 

The union bound on the probability of error at the 
i-th stage of decoding is then given by 

p e l) < EE ^OO^erfc ( Ay/Y/2 ) (4) 

where the summation is taken over all Y defined above. 
Thus 


P ‘<E X>< y )5«fc(VS72). (5) 

Si(Y) depends on the symbol weight distribution of 
the i-th outer code B t , the Hamming weight distribu- 
tion of the MLC code C and critically on the weight 
distribution (of n^-tuples) in the cosets of the parti- 
tion Ai/Ai+i. For inner codes of small lengths the 
coset weight distribution can be evaluated easily by an 
exhaustive computer search program or using combi- 
natorial methods. 

III. Results for some MLCC codes 
Consider the decomposition 

RM(64, 42, 16) = {(8, 1. 8)(8, 4, 4) 3 , (8, 7, 2) 3 (8, 8. 1)} * 

{(8,8,1), (8.4,4), (8,0)} 


given in [3]. The symbol weight distribution of B\ is 
'Vbi(O) = 1,^(4) = 98, N Bl (6) = l,176,iV Sl (7) = 
1, 344, Nb^S) = 5,5/3. The symbol weight dis- 
tribution of B 2 is A r £ 3 (0) = 1, Nb 7 ( 1) = 8, 

Nb 7 (2) = 812, N B2 {Z) = 23,576, N Bl ( 4) = 443,030, 
Nb 7 {5) = 5,315,576, Nb 2 (6) = 39, 867, 212, N B ,(7) = 
170,859,368, N B *{ 8) = 320,361,329. The coset 
weight distributions of A^/Ao and A 2 /{0} are given 
in Table 1. 

To compute Si(Y) with i = 1,2 , we have to deter- 
mine how many n A ns - tuples of weight Y can result 
from a b G of symbol weight u;j(B^) = X. In 
order to answer this question, we must see if a parti- 
tion of the integer Y into X parts is possible with each 
part belonging to a valid entry in the coset weight dis- 
tribution of the partition (8,8, l)/(8,4,4). A partition 
of the form 8 = 24 - 3 + 14-2 is invalid because none 
of the non-zero cosets in the above partition have a 
vector of Hamming weight 1 or 3. Furthermore, the 
partitions must be counted in an ordered fashion (i.e., 
we are counting the so called distributions of an in- 
teger), For example, there are 12 distributions of 16 
into parts {4, 4, 2, 6}. The partition table for first two 
nonzero weights in both first and second stage are given 
in Table 2. 

The effective error coefficient in stage- 1 is determined 
as 5i(8) = 98 x 256 = 25,088. The next two error 
coefficients are 5i(10) = 98 x 4 x 4 3 x 8 = 200704 and 
Si(12) = 98 x (6 x 8 2 x 4 2 + 4 4 ) + 4 6 x 1176 = 5444096. 
The higher order error coefficients can be computed in 
a similar manner. 

In the second stage of decoding, the effective error 
coefficient is S 2 (8) = 8 + 784 = 792 and the er- 
ror coefficient S 2 (12) = 18,816. The first three er- 
ror coefficients for this 2- level decomposition of RM 

(64.42.8) corresponding to weights 8, 10 and 12 are 
Si( 8) + 5 , 3 (8) = 25,880 , 5^10) = 200,704 and 
5i(12) + S 2 ( 12) = 5,462,912. 

In summary, for the Reed-Muller code C = 

(64.42.8) , an upper bound (with first two significant 
terms) on the block error probability with two-stage 
decoding is 

P,.2-Sta ,< < 2.5 /O' 4 1 erfc (Ay/s 75) + 

2.0 10 3 1 erfc (Ay/wfi) + 5. 462, 912^ erfc (A/n/2) . 

The complexity of stage-2 can be reduced by employ- 
ing a 3-stage decomposition 


RM(64, 42. 8) = 

{(8, 1. 8)(8, 4, 4) 3 , (8, T. 2) 3 . (8, 7, 2)(8. 8, 1)} * 

{(8,8,1), (8, 4,4), (8, 2, 4), (8,0)} (6) 


for which 



Pt, 3— stage < 2.6 10 4 | erfc (As/8/2) + 
2.0 10 5 i erfc (.4>/l0/2) + 5 - 4 1q6 ^ erfc (- 4 \ / 12 / 2) . 

Notice that the asymptotic loss in decoding gain, due 
to this simplification, predicted from the bound is prac- 
tically zero, and this agrees with actual simulation re- 
sults shown in Figure 1. 

Table 1 


Coset- #’s 

Weight of vector 


0 

2 

4 

6 

8 

0 

1 

- 

14 

- 

1 

1-7 

- 

4 

8 

4 

- 


Table 2 


w 3 (b) 

Partition 

Ni 

N 2 

4 

{2, 2, 4, 4} 

6 

4 a iFr' 

4 

{2, 2,2,6} 

4 

4 J 4 

4 

{2, 2, 2, 4} 

4 

4 d 8 

5 

Not possible 


- 

7 

Not possible 

- 

- 

6 

{2, 2, 2, 2, 2, 2} 

1 

4° 


N\ =Number of partitions. iV 2 = Number of code- 
words in B i * [At/Ai+i] per partition. 

For long codes, even 2-stage CCD cannot be per- 
formed in an optimal manner due to the complexity 
of the trellises involved. Hence it is imperative that 
tight bounds be derived so that the loss of coding gain 
at high SNRs due to m-stage (m > 3) can be predicted. 

The loss in coding gain of 2-stage relative to m- 
stage decoding can be determined based on the er- 
ror coe'hcients derived in the paper. For the code 
C = (128,64,16) RM, and n = 2,3, this is shown 
in Figure 2. 

The corresponding expressions are 

Postage < 13.8 10 6 ^ erfc (Ay/ttfi) + 

5.1 10 9 ~ erfc ^A>/24/2^ 

Pe, 3-stage < 1-6 x 10 9 -erfc (^4\/l6/2^ + 

6.1 x 10 13 --erfc ^Ay/24/2^ 

The three stage decomposition has a trellis complex- 
ity of 512, 128, and 512 states for the first, second, 
and third stage, and the average number of operations 
per information bit is 1263. The most of the computa- 
tional complexity is due to a large and fully conected 
trellis of the third stage, so a way to reduce it is to im- 
plement 4-stage decoding by dividing the third stage 
into two stages. 


The difference between the bounds for 2-stage and 3- 
stage at block error rate 10“ 6 is about 0.6 dB. Since 
the simulation of 2-stage decoding with MLD at each 
stage is not available, we compared the results of 3- 
stage decoding with a 2-stage decoding where the first 
stage is decoded using iterative search. We believe this 
would be very close in performance of optimum 2-stage 
decoding. The difference in performance for this 2- 
stage and optimum 3-stage decoding is 0.6 dB as well. 

The third example is done for the (72,52,8 decom- 
posable code. Bounds for 3 and 4-stage decoding of 
this code are given in Figure 3. 

The state complexities are 2, 512,8, 1 for the 4-stage 
decoding, and {1024,8, 1} for the 3-stage decoding. 

IV. Trade-offs between complexity and 

PERFORMANCE 

We have shown in the previous section that the perfor- 
mance of 2 and 3-stage decoding of (64, 42, 8) code are 
almost identical. However, this is true only for 3-stage 
decomposition (6). Reduction in state complexity is 
not significant, since maximum number of states in the 
first stage remains 128 (the same as for 2-stage decom- 
position) and the maximum numbers of states at the 
second and third stage are 4 and 2 respectively (as com- 
pared to 8 states in 2-stage decomposition). The aver- 
age number of operations per information bit, though, 
is reduced from 393 to 133 (almost three times). 

For another 3-stage decomposition, 

RM(64, 42, 16) = 

{(8, 1, S)(S,4, 4), (8, 4, 4) 2 , (8,7, 2) 3 (8, 8,1)}* 

{(S, 8,1), (8, 6, 4), (8, 4, 4), (8,0)} , (7) 

maximum number of states is reduced to 16 (in the 
second stage), so 8 times compared to two stage de- 
compoition. Unfortunately, this decomposiyion loses 
about 0.5 dB in performance, and the average num- 
ber of operations per information bit is 227. Thus the 
3-stage decomposition (6) outperforms decomposition 
(7), while having smaller computational complexity. 

We can see that the performance of the multistage 
scheme depends mainly on the first decoding stage. 
This suggests that when increasing the number of 
stages, lower level stages should be divided, while the 
first should be kept as long as the number of states 
in the outer code trellis is not too big. In such man- 
ner, the performance degradation will not be large (in 
some cases almost negligible). Since lower level codes 
in multilevel decomposition are high rate codes with 
complex trellis structure, splitting each of them into 
two levels will reduce the complexity significantly (as 
we have seen in the example of RM(64,42,8) code). 
This can be used to determine a good decomposition 
of a code. 





V. Conclusion 

In this paper, we have shown how to bound the per- 
formance of multistage decoding of Multilevel concate- 
nated codes. We presented examples which show that 
the simulation results converge to derived bounds at 
moderate to high signal-to-noise ratios. Therefore, an 
estimate of the difference in performance for differ- 
ent decompositions and different numbers of decod- 
ing stages can be determined. Analysis of reduction 
in complexity when using large number of decoding 
stages has also been discussed. 



Figure 1: Simulation results and performance bounds 
for RM(64,42,8) code 



Figure 2: Simulation results and performance bounds 
for RM(128,64,16) code 
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Abstract — In this paper, the bit error prob- 
ability Pb for maximum likelihood decoding of 
binary linear codes is investigated. The con- 
tribution of each information bit to Pb is con- 
sidered. For randomly generated codes, it is 
shown that the conventional approximation at 
high SNR Pb « {dn/N) ■ P,, where P s represents 
the block error probability, holds for systematic 
encoding only. Also systematic encoding pro- 
vides the minimum Pb when the inverse map- 
ping corresponding to the generator matrix of 
the code is used to retrieve the information 
sequence. The bit error performances corre- 
sponding to other generator matrix forms are 
also evaluated. Although derived for codes with 
a generator matrix randomly generated, these 
results are shown to provide good approxima- 
tions for codes used in practice. Finally, for de- 
coding methods which require a generator ma- 
trix with a particular structure such as trellis 
decoding or algebraic-based soft decision decod- 
ing, equivalent schemes that reduce the bit er- 
ror probability are discussed. 

I. Introduction 

In this paper, we consider the minimization of the bit 
error probability Pb for maximum likelihood decoding 
(MLD) of linear block codes. Although not optimum, 
this minimization remains important as MLD has been 
widely used in practical applications. We assume that 
the information sequence of length I\ is recovered from 
the decoded codeword based on the inverse mapping 
defined from the generator matrix of the code. For 
block codes, the large error coefficients can justify this 
strategy which is explicitly or implicitly used in many 
decoding methods such as conventional trellis decod- 
ing, multi-stage decoding or majority-logic-decoding. 
Therefore, for a particular code and the same optimal 
block error probability, we determine the best encod- 
ing method for delivering as few erroneous informa- 
tion bits as possible whenever a block is in error at 
the decoder output. We first derive a general upper 
bound on Pb which applies to any generator matrix 
and is tight at medium to high signal to noise ratio 
(SNR). This bound considers the individual contribu- 
tion of each information bit separately. For randomly 
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generated codes, we then show that the systematic gen- 
erator matrix (SGM) provides the minimum bit error 
probability. To this end, a submatrix of the generator 
matrix defining an equivalent code for the bit consid- 
ered is introduced. Note that a similar general result 
holds for the optimum bit error probability related to 
the BSC [2]. We finally discuss how to achieve this per- 
formance whenever the systematic encoding is not the 
natural choice, as for trellis decoding [3] or for MLD in 
conjunction with algebraic decoding [4]-[8] . For exam- 
ple, for trellis decoding of the (32,26,4) Reed-Muller 
(RM) code, at low SNR a performance degradation of 
more than 1 dB is recovered with the proposed method. 
Minimizing the bit error probability associated with 
MLD becomes even more important whenever the con- 
sidered block code is used as the inner code of a con- 
catenated coding system [9]. 

II. Bit Error Probability for MLD 
Suppose an (N, A', d//) binary linear code C with gener- 
ator matrix G is used for error control over the AWGN 
channel. Defining 

1 K 

a = 77 y\ p>u), a) 

h U 

where Pb(j) represents the error probability for the j' h 
bit in a block of A' information bits delivered by the 
decoder, we obtain from the union bound 

N 

Pb(j ) < ( v 7) , (2) 

i-dn 

where Q{x) = (tjV 0 ) -1 ^ 2 e“ a t N °dn . We call Wi(j) 
the effective error coefficient associated with the 
j th information bit with respect to the genera- 
tor matrix G. 

We can prove the following theorem. 

Theorem 1 Let represent the number of codewords 
of weight i m the code C generated by G and lei W{(j) 
represent the number of codewords of weight i in the 
subcode generated by the matrix G(j) obtained after 
deleting row-j in G; then 

tii(j) = v-i - Wi(j). (3) 

Theorem 1 depends on the mapping defined by G as 
it implicitly assumes that the inverse mapping corre- 
sponding to G is used to retrieve the information bits 


from the decoded code sequence. Since for a linear 
code, this mapping is one-to-one and thus invertible, 
Theorem 1 is valid for any representation of G } sys- 
tematic as well as non-systematic. Combining (1) and 
(2), the average bit error probability is expressed as 


s ( i 7 H “W) ) Q (A) - ( 4 ) 

i = d H \ j - 1 J 

For a code defined by a matrix G randomly generated, 
we associate with each information bit j £ [1,/v] a 
matrix 


Da(j) = 


0 

/*-! 



(5) 


where 1 and /<*_! represent the all-1 vector and the 
identity matrix of dimension or — 1 respectively. The 
matrix D a (j) is defined as the dependency matrix 
associated with dimension j of the generator 
matrix G . This matrix allows to derive the following 
theorem. 


Theorem 2 Let consider an (N, K) linear block code 
C with a generator matrix generated randomly . Then 
the value W{{j) corresponding to the dimension j with 
dependency matrix D a (j) is well approximated by 


l^J 

i=o 


a 

2 /+ 1 


N-ct 
i — (2/ H- 1) 


( 6 ) 


Theorem 2 indicates that the larger a, the larger the 
corresponding Pi>(j). Consequently, a = 1 gives the 
smallest bit error probability. For this case, D\ = [1] 
which corresponds to a systematic encoding. There- 
fore, the optimum bit error probability for MLD at 
medium to high SNR is achieved by a systematic en- 
coding if the inverse-mapping defined by G is used to 
retrieve the information bits. This strategy is intu- 
itively correct since whenever a code sequence esti- 
mated by the decoder is in error, the best strategy 
to recover the information bits is simply to determine 
them independently. Otherwise, errors propagate. For 
a = 1, (6) becomes [1] 


Wi(j) as 2 ^ « (i/N) I Vi. (7) 

In that case only, at high SNR, the bit error probability 
for MLD follows 

Wi " Q (v^«) • (8) 

For Reed-Muller (RM) codes oflength N < 64, we 
computed the ratios u>d H (j)/wdH corresponding to (6) 
for various forms of generator matrices. In all cases, the 
value computed from (6) is the exact ratio, although 
the weight distribution of RM codes is far from a bi- 
nomial distribution. 


III. Applications 
A. ML trellis decoding 

ML trellis decoding is based on the trellis oriented gen- 
erator matrix (TOGM) of the code considered [3]. If 
this matrix is used for encoding, trellis decoding be- 
comes suboptimum with respect to the bit error prob- 
ability of MLD. We present a simple method to over- 
come this problem. 

Let G% denote the TOGM of the code C%. Then, by 
row additions only, it is possible to obtain the gener- 
ator matrix G of an equivalent code C which contains 
the K columns of the identity matrix. This matrix is 
known as the reduced echelon form (REF). These op- 
erations modify the mapping between information bits 
and codewords, but since no column permutation has 
been realized, each codeword of C is still uniquely rep- 
resented by a path in the trellis of C £ . Therefore ML 
trellis decoding of the received sequence is still possible 
if we use G for encoding. The trellis decoder estimates 
the code sequence which is closest to the received se- 
quence. Then the information bits are easily retrieved 
due to the systematic nature of G. Since no restric- 
tions on G t apply, the matrix G can be obtained for 
any possible trellis decomposition. 

In [10], a specific ML trellis decoding algorithm for 
the (63,57,3) Hamming code is proposed. The decod- 
ing is realized based on a generator matrix in cyclic 
form. It is also shown that an equivalent systematic 
representation outperforms the cyclic form by 0.4 dB 
at the BER 10~ 5 . However, the decoding of the sys- 
tematic code requires an additional step. By process- 
ing the generator matrix in cyclic form as described in 
this section, this additional step can be removed as the 
encoding matrix becomes G = [h-P*]- On the other 
hand, the cyclic structure no longer exits, but the en- 
coder remains very simple. 

Figures 1 and 2 depict the simulation results for the 
(32,16,8) and (32,26,4) RM-codes respectively. For 
both codes, we simulated ML decoding based on the 
REF and the conventional TOGM described in [3], and 
plotted the first term of the union bound derived from 
(4). As expected from the results of Section II, we ob- 
serve a larger gap in error performance for the (32,26,4) 
RM-code. At the bit error rate (BER) 10“ 6 , the gap in 
performance for this code is about 0.2 dB, which is of 
the same order as the difference between closest coset 
decoding (CCD) and ML trellis decoding [11]. Also, 
we observe a much significant gap at high BER of 0.4 
dB for the (32,16,8) code and 1.1 dB for the (32.26,4) 
code. This behavior becomes important if a concate- 
nated coding scheme is used. 

The extension of this method to multi-stage trellis 
decoding does not follow in a straightforward way. In 
general, multi-stage decoding methods exploit the de- 
composable structure of the code considered, so that 
row additions on the associated generator matrix can 



destroy this structure. For example, CCD of |u|u + r|- 
constructed codes exploits the repetition of the u- 
component code [11]. As a result, row additions in each 
component code generator matrix are allowed, but not 
from one matrix to the other. In addition, the propa- 
gation of decoding errors between decoding stages also 
has to be considered when searching for the optimum 
encoding matrix associated with multi-stage decoding. 

B. MLD in conjunction with algebraic decoding 

Several soft decision decoding algorithms in conjunc- 
tion with an algebraic decoder have been proposed [4]- 
[ 8 ]. In general, algebraic decoding is associated with a 
particular generator matrix form Ga* Therefore, if this 
form is used for encoding, the corresponding algorithm 
becomes suboptimum with respect the bit error prob- 
ability of MLD. Algebraic decoding algorithms can be 
divided into two classes, depending on whether the de- 
coder delivers an estimate of the transmitted codeword 
of length N or of the information sequence of length 
K. In the first case, the method of Section A extends 
in a straightforward fashion. Hence decoding of cyclic 
codes can be realized this way. However, a similar 
method is also possible for the second class of algebraic 
decoders. Again, this method is transparent with re- 
spect to algebraic decoding, so that the conventional 
algebraic decoder corresponding to the code consid- 
ered can still be used. This method simply consists 
of recording the row operations processed to obtain G 
in REF form G a and applying the inverse operations 
to the information sequence delivered by the algebraic 
decoder. 

Figure 3 depicts the improvement achieved by this 
method for Chase algorithm-2 with majority-logic- 
decoding for the (64,42.8) RM-code. The proposed 
method outperforms Chase algorithm-2 with conven- 
tional majority-logic-decoding by 0.15 dB at the BER 
1<T 5 . 


C. Concatenated coding 

We consider the concatenated scheme presented in [12] 
where the inner code is a (64,40) subcode of the (64,42) 
RM code and the outer code is the NASA standard 
(255,223) RS code over GF{2 S ). The outer code is in- 
terleaved to a depth of 5. For this scheme, Figure 4 
represents the simulated bit error performance for en- 
coding with the TOGM and the REF. We observe that 
the systematic encoding outperforms the TOGM by 
about 0.2 dB at the BER 10“ 5 . More importantly, we 
also notice that while the error performance curves cor- 
responding to the inner codes differ by a constant value 
due to different error coefficients, the difference in bit 
error probability between the error performance curves 
corresponding to the concatenated system increases as 
the SNR increases. 


IV. Conclusion 

In this paper, we have showed that for many good 
codes, the SGM provides the best bit error probabil- 
ity for MLD when the inverse mapping of the gener- 
ator matrix G is used to retrieve the information se- 
quence. Based on the presented results, we can con- 
clude that a careful choice of the generator matrix be- 
comes important when comparing different optimum, 
near-optimum or suboptimum soft decision decoding 
schemes. Generally, tenths of dB’s separate the bit er- 
ror performance of such schemes, so that a poor choice 
of the generator matrix of one of the scheme may result 
in an important relative degradation. 

By exploiting the fact that modifying the mapping 
between information bits and codewords is transpar- 
ent to the decoder, we modified conventional trellis 
decoding and MLD in conjunction with an algebraic 
decoder so that these schemes achieve the same bit 
error performance as for systematic encoding. Hence 
the decoding becomes independent of the encoding and 
can simply be viewed as a process providing the most 
likely codeword of the codebook. As a result, the de- 
coder structure remains the same as the conventional 
one but in some cases the decoded sequence requires 
an additional simple reprocessing. 
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Figure 1: Simulated and theoretical bit error probabil- 
ities for the (32,16,8) RM code with TOGM and REF. 



Figure 2: Simulated and theoretical bit error probabil- 
ities for the (32,26,4) RM code with TOGM and REF. 



Figure 3: Simulated bit error probabilities for Chase 
algorithm-2 of the (64,42,8) RM code with majority- 
logic-decoding in Boolean and systematic forms. 



(255,223) RS outer code and (64,40,8) inner code, and 
encoding with REF and TOGM. 
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Abstract — This paper is concerned with con- 
struction of multilevel concatenated block mod- 
ulation codes using a multi-level concatena- 
tion scheme [1] for the frequency non-selective 
Rayleigh fading channel. In the construction of 
multilevel concatenated modulation code, block 
modulation codes are used as the inner codes. 
Various types of codes ( block or convolutional, 
binary or nonbinary ) are being considered as 
the outer codes. In particular, we focus on 
the special case for which Reed- Solomon (RS) 
codes are used as the outer codes. For this spe- 
cial case, a systematic algebraic technique for 
constructing g-level concatenated block modu- 
lation codes is proposed. Codes have been con- 
structed for certain specific values of q and com- 
pared with the single-level concatenated block 
modulation codes using the same inner codes. 
A multilevel closest coset decoding scheme for 
theses codes is proposed. 

I. Introduction 

Single-level concatenated trellis coded modulation 
(TCM) for AWGN channel was first introduced by 
Deng and Costello in 1989 [2, 3]. Almost at the same 
time, Kasami et. al. presented a single-level concate- 
nated block coded modulation (BCM) scheme for reli- 
able data transmission over the AWGN channel [4]. Er- 
ror performance of the single-level concatenated TCM 
and BCM schemes for the Rayleigh fading channel was 
first investigated by Vucetic and Lin in 1991 [5] and 
then by Vucetic in 1993 [6]. 

In the single-level concatenated BCM scheme [4], ev- 
ery information bits of inner code are protected by the 
same degree. In the AWGN channel, it is possible to 
design the inner code such that the bit error rate of 
each information bit is almost same. However, in the 
Rayleigh fading channel, it is not easy to design the 
inner code such that the bit error rate of each informa- 
tion bit is almost same. Therefore, in the single-level 
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concatenated BCM system, bit-error-rate of some in- 
formation bits dominates the overall BER of the coded 
system and results in poor bit error performance. How- 
ever, in a multilevel concatenated BCM scheme, it is 
possible to provide different degree of protection to in- 
formation bits with different bit error rates. Outer 
codes in each level are designed to stop the error prop- 
agation to the next level of decoding. 

Multilevel concatenated BCM codes constructed in 
this paper are designed to achieve better bit error per- 
formance than the single-level concatenated BCM code 
by stopping the error propagation to the next level 
decoding. Simulation results show that these codes 
achieve very impressive real coding gains over the un- 
coded reference system and single-level concatenated 
BCM codes using the same inner codes. 

II. Multilevel Concatenated Block Coded 
Modulation Scheme 

The proposed multi-level concatenated coded modu- 
lation schemes are constructed using a multi-level con- 
catenation approach [1]. 

In a g-level concatenated coded modulation system, q 
pairs of outer and inner codes are used as shown in Fig- 
ure 1. In block coding, Reed-Solomon (RS) codes are 
used as the outer codes, and coset codes constructed 
from a block modulation code and its subcodes are 
used as the inner codes. The encoding and decoding 
are accomplished in q levels respectively. 

Outer Code Construction 

For 1 < i < q and 1 < j < m,, let Bij be an (V, A'*) 
RS (or shortened RS) code over GF( 2 P ) with mini- 
mum Hamming distance D{ = N — /\, + 1. In the 2 -th 
level outer code encoder, a (N, K{ , Z?,-) RS code is inter- 
leaved w ith depth m*. For 1 < j < m,-, let Bij rep re 
sent the j- th code among m t - inter leaved RS coder 
After i-th level outer code encoding, symbols from 
GF(2 P ) in each RS code are converted into p-bits bi- 
nary representation. After conversion, m,* jVp-bits are 
stored in the m* by Np array such that every column 
has mi bits and each bit in this column is selected from 
each RS code Bij for 1 < j < m t . Since Bij = B lf k 


for 1 < j, k < m,, let B, = • • • , rep- 

resents an i-th level outer code for 1 < i < q. The i-th 
level outer code encoder is shown in Figure 2. Later, 
these q-sets of RS codes will be used as the outer codes 
in q levels of concatenation. 

Inner Coset Code Construction 

Let Ao be a block modulation code over a certain el- 
ementary signal set 5 with length n, dimension k Q and 
minimum squared Euclidean distance A 0 . We require 
that 

= mi -F m2 + V m q (1) 

From Ao, we form a sequence of subcodes, Ao, Ax, Ao 
i ■ * • j A q) where A q consists of the all-zero codeword, 
i.e. A ff = {0}. The dimension of these subcodes satisfy 
the following conditions: For 1 <i<q, A t is a linear 
subcode of A t *_i with dimension 

ki — ^i_i — m,- (2) 

and minimum squared Euclidean distance A, . From 1 
and 2, we have 


A»_i. Note that each coset in A 0 /Ax/ • * */A ? -i con- 
sists of 2 m « codewords in A 0 . Since A ? = {0}, each 
coset in Ao/Ax/ • ■ -/A q consists of only one codeword 
in A 0 . Hence the minimum squared Euclidean distance 
of each coset is A q = oo. The minimum squared dis- 
tance among the cosets of a coset in Aq/Ai / • • • /A q - X 
modulo A q is A^_x. The above partition process re- 
sults in a sequence of q coset codes, 


A\ ~ Ao/Ai 
■A 2 = A0/A1/A2 


( 4 ) 


A q — Ao/Ai/ • * */A ? 

These q coset codes are used as inner codes in the pro- 
posed g-level concatenated coded modulation scheme. 
This < 7 -level concatenated modulation code C is de- 
noted as follows: 

C^{Bx,B 2 , {Ai,A2j"-,A g } (6) 


k 1 = m 2 + m 3 -f b m q „ l -f m q 

k2 = rn 3 -b m 4 + b m q 

: (3) 

kq-i = m q 

kq = 0 

We also note that A 0 < A x < • • • < A q and that A q 
consists of only the all-zero codeword with A ? = 00 . 

Now we are going to construct q coset codes from 
Aq, A Xi • • • t A q . These q coset codes will be used as 
the inner codes in the proposed <y-level concatenated 
coded modulation scheme. First we partition Ao into 
2 mi cosets modulo A x . Let A 0 /Ai denote the set of 
cosets of A 0 modulo A x . The minimum squared Eu- 
clidean distance of each coset in Ao/Aj is Ai. The min- 
imum squared distance between two cosets in Ao/Ax is 
Ao. Ao/Ax is called the coset code of Ao modulo 
Ax. Next we partition each coset in A 0 /Ax into 2 ma 
cosets modulo A 2 . Let A 0 /Ax/A 2 denote the set of 
cosets of a coset in A 0 /Ax modulo A 2 . It is clear that 
the minimum squared Euclidean distance of a coset in 
Ao/Ax/A 2 is A 2l and the minimum squared distance 
among the cosets of a coset in Aq/Ai modulo A 2 is Ai. 
We call Ao/Ax/A 2 the coset code of Ao/Ax mod- 
ulo Ao. We continue the above partition process to 
form coset codes. For 1 < i < q, let A 0 /Ax/ • • */A t _i 
be the coset code of Ao/Ax/ * • • /A*_2 modulo A t -_x- 
W r e partition each coset in A 0 /Ax/-*-/A i-i into 2 m% 
cosets modulo A^ Then A 0 /Ax/--*/Ai is the coset 
code of Aq/Ai/ ♦••/A i-x modulo A,. The minimum 
squared Euclidean distance of a coset in Ao/Ax/***/ 
Ai is A,, and the minimum squared distance among 
the cosets of a coset in A 0 /Ax/ • • ■ /Ai_ x modulo A, is 


If Ao, Ax, • * • , Af_x have simple trellis diagrams, the 
coset inner codes, Ax, A 2 , * • • , A q , also have simple trel- 
lis diagrams. If the coset inner codes have simple trellis 
structure, then we can use Viterbi decoding to decode 
coset inner codes. This will decrease decoding com- 
plexity of inner codes drastically. 

III. Multilevel Closest Coset Decoding 
A multilevel closest coset decoding for the proposed 
scheme is presented in this section. Each level decod- 
ing consists of the inner closest coset decoding and the 
outer code decoding. At i-th level decoding, the inner 
coset code Ai is d ecoded by using decoded estimates 
from first level decoder to i— 1 th level decoder. In the 
multilevel concatenated scheme, the i—1 th level outer 
code is designed to reduce the error propagation from 
i 1 th level decoder to i-th level decoder. Since de- 
coded information at each level is passed to next level, 
decoding at eachJevel depends on decoded information 
from the preceding level. Therefore, error propagation 
may occur. To reduce the probability of error propaga- 
tion, outer codes must be selected by considering the 
specific channel characteristic. In following sections, 
multilevel concatenated codes are constructed by fol- 
lowing rule. In the Rayleigh fading channel, strong 
outer codes must used for levels where inner codes have 
small minimum symbol and produ ct distances. 

IV. Example 

Consider a two-level concatenated coded 8-PSK mod- 
ulation system for the frequency non-selective Rayleigh 
fading channel. In this system, two 3-level 8-PSK mod- 
ulation codes of length 8 are used a s the inner codes 
and two RS codes over the Galois field GF{ 2 9 ) are 
used as the outer codes as shown in Figure 3. The two 


inner codes are constructed from two 3-level 8-PSK 
modulation codes, Aq = A[(8, 4, 4) * (8, 7, 2) * (8,7, 2)] 
and Ai = A[(8, 1,8) * (8,4,4) * (8,4,4)]. A 0 has di- 
mension 18, minimum symbol distance 2, minimum 
product distance 4, and minimum squared Euclidean 
distance 2.344. And Ai has dimension 9, minimum 
symbol distance 4, minimum product distance 16, and 
minimum squared Euclidean distance 4.688. Ai has a 
very simple trellis structure and can be decoded in ei- 
ther 3 stages, 2 stages, or one stage. Either way, the 
decoding complexity is very simple. 

Since (8,4,4) code is a subcode of (8,7,2) and (8,1,8) 
is subcode of (8,4,4), A x is a subspace of Ao- Now, par- 
tition Ao with respect to Ai. Let Ao/Ai denote this 
partition. Then Aq/A\ consists of 2 9 cosets, denoted 
with 1 < i < 2 9 , modulo Aj. Each coset in Ao/Ai has 
a trellis structure identical to that of Ai. Let [Ao/Ai] 
denote the set of coset representatives of the cosets in 
Aq/Ai. [Ao/Ai] is called a coset code. In the pro- 
posed two-level concatenated coded 8-SK modulation 
system, [Aq/Ai] is used as the first-level inner code and 
Ai is used as the second-level inner code. Let C denote 
this two-level concatenated coded modulation system. 

The two outer codes used in the proposed system are 
the (511,411,101) and (511,499,13) RS codes with sym- 
bols from the Galois field GF(2 9 ). The (511,411,101) 
RS code is used as the first-level outer code and the 
(511,499,13) RS code is used as the second-level outer 
code. Each code symbol can be represented by a binary 
9-bits. 

The spectral efficiency of the above two-level concate- 
nated coded 8-PSK system is 2.0034 bits/symbol. Its 
bit error performance over the frequency non-selective 
Rayleigh fading channel using two-level closest coset 
decoding is simulated and shown in Figure 4. 

For comparison purpose, we construct a single-level 
concatenated modulation code with Aq as inner code. 
Since Ao has 18 information bits, (511,455,57) RS code 
is used as outer code with interleaving depth 2. Each 
symbols over GF( 2 9 ) in outer code is converted into 
an 9-bits. Then form an 511 by 18 array where each 
column consists of 18 bits. Each column is then en- 
coded into a codeword in Aq. Let C(S) denote the 
above single-level concatenated BCM code. The spec- 
tral efficiency of the code C(S) is 2.0034 bits/svmbol. 
The error performance of the coherently detected 8- 
PSK single-level concatenated modulation code C(S ) 
over the Rayleigh fading channel is also shown in Fig- 
ure 4. The two-level concatenated modulation code C 
achieves a 1.4757 dB real coding gain over the single- 
level concatenated modulation code C(S) at a bit er- 
ror rate (BER) 10“ 5 with the same spectral efficiency. 
Also, two-level concatenated code achieves a 32.259 
dB real coding gain at a BER 10“ 5 over the uncoded 
QPSK modulation without bandwidth expansion. 


Other two-, three- and six-level concatenated BCM 
schemes have been devised for the Rayleigh fading 
channel and they all achieve very impressive real cod- 
ing gains over the uncoded reference system and single- 
level concatenated BCM schemes using same inner 
codes. 
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Figure 1 q-leve! concatenated block modulation code encoder 
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Figure 2 The i-th level outer code encoder 


Figure 3 Two-level concatenated block 
modulation code 
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Figure 4. Bit error performance of 8-PSK two-level concatenated 
block modulation code over the frequency non-selective 
Rayleigh fading channel. 















